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<! — StartFragment — > RESULT 1 
AGU69156 

LOCUS AGU69156 681 bp mRNA linear PLN 05-JAN-1999 

DEFINITION Alnus glutinosa actinorhizal nodulin AgNOD-GHRP (AgNt84) mRNA, 

complete cds . 
ACCESSION U69156 
VERSION U69156.1 GI:4097819 

KEYWORDS 

SOURCE Alnus glutinosa 

ORGANISM Alnus glutinosa 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicotyledons ; 
rosids; eurosids I; Fagales; Betulaceae; Alnus. 
REFERENCE 1 (bases 1 to 681) 
AUTHORS Twigg,P.G. 

TITLE Isolation of a nodule-specific cDNA encoding a putative 

glycine-rich protein from Alnus glutinosa 
JOURNAL Thesis (1993) The University of Tennessee, Knoxville, TN, USA 
REFERENCE 2 (bases 1 to 681) 

AUTHORS Dobritsa, S.V. and Mullin,B.C. 

TITLE In vitro expression of actinorhizal nodulin AgNOD-GHRP and 

demonstration of its toxicity ot Escherichia coli 

JOURNAL (in) Stacey,G., Mullin,B.C. and Gresshof f , P .M. (Eds.); 

THE BIOLOGY OF PLANT-MICROBE INTERACTIONS: PRECEEDINGS OF THE 8TH 
INTERNATIONAL SYMPOSIUM ON MOLECULAR PLANT-MICROBE INTERACTIONS; 
(1996) In press 
REFERENCE 3 (bases 1 to 681) 

AUTHORS Pawlowski,K. , Twigg,P.G., Dobritsa, S .V. , Guan,C. and Mullin,B.C. 

TITLE A nodule-specific gene family from Alnus glutinosa encodes glycine 

and histidine-rich proteins expressed in the early stages of 
actinorhizal nodule development 

JOURNAL Unpublished (1996) 
REFERENCE 4 (bases 1 to 681) 

AUTHORS Twigg,P.G. and Mullin,B.C. 

TITLE Direct Submission 

JOURNAL Submitted ( 03-SEP-1996 ) Botany, University of Tennessee, 437 Hesler 
Biology Building, Knoxville, TN 37996, USA 
FEATURES Location/Qualifiers 
source 1. .681 

/organism="Alnus glutinosa" 
/mol_type=" mRNA " 
/db_xref="taxon:3517" 
/tissue_type="root nodules" 
gene 1. .681 

/gene="agNt84" 
CDS 74. .373 

/gene="agNt84" 

/ not e=" glycine /histidine-rich protein" 
/codon_start=l 

/product="actinorizal nodulin AgNOD-GHRP" 
/protein_id="AAD00171 .1" 
/db_xref ="GI : 4097820" 

/translat ion="MGYSKTFLLLGLAFAVVLLI SSDVSASELAVAAQTKENMQTDGV 
EE DKYHGHRHVHGHGHGHVHGNGNEHGHGHHHGRGHPGHGAAADE TETETETNQN" 



Alignment Scores: 

Pred. No.: 1.61e-50 Length: 

Score: 543.00 Matches: 

Percent Similarity: 100.0% Conservative: 

Best Local Similarity: 100.0% Mismatches: 
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Query Match: 
DB: 



Indels : 
Gaps : 



US-10-566-598-1 (1-99) x AGU69156 (1-681) 

Qy 1 MetGlyTyrSerLysThrPheLeuLeuLeuGlyLeuAlaPheAlaValValLeuLeuIle 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 74 ATGGGTTACTCCAAGACTTTTCTTCTCCTTGGCCTTGCCTTTGCTGTTGTGCTCCTCATC 133 

Qy 21 SerSerAspValSerAlaSerGluLeuAlaValAlaAlaGlnThrLysGluAsnMetGln 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 134 TCCTCCGATGTCTCAGCTTCTGAGCTTGCTGTTGCCGCTCAAACCAAGGAGAATATGCAA 193 

Qy 41 ThrAspGlyValGluGluAspLysTyrHisGlyHisArgHisValHisGlyHisGlyHis 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 194 ACTGACGGTGTGGAGGAGGATAAGTATCATGGCCATCGTCACGTGCATGGACATGGGCAT 253 

Qy 61 GlyHisValHisGlyAsnGlyAsnGluHisGlyHisGlyHisHisHisGlyArgGlyHis 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 25 4 GGACATGTACATGGGAATGGGAATGAACATGGACATGGTCATCACCACGGCCGTGGTCAC 313 



Db 



81 ProGlyHisGlyAlaAlaAlaAspGluThrGluThrGluThrGluThrAsnGlnAsn 99 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
314 C C AGGAC AC GGTGCTGCT GC AGAC G AGAC AGAAAC C G AAAC T G AAAC C AAC C AAAAT 370 



RESULT 2 

Y08436 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
CDS 



Y08436 687 bp mRNA linear PLN 22-SEP-1997 

A.glutinosa mRNA for Agl64 protein. 

Y08436 

Y08436.1 GI:2437816 

agl64 gene; glycine-rich protein; histidine-r ich protein; 
nodule-specific protein. 
Alnus glutinosa 
Alnus glutinosa 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta ; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicotyledons ; 
rosids; eurosids I; Fagales; Betulaceae; Alnus. 

1 (bases 1 to 687) 

Pawlowski,K. , Twigg,P., Dobritsa,S., Guan,C. and Mullin,B.C. 
A nodule-specific gene family from Alnus glutinosa encodes glycine- 
and histidine-rich proteins expressed in the early stages of 
actinorhizal nodule development 

Mol. Plant Microbe Interact. 10 (5), 656-664 (1997) 
9204569 

2 (bases 1 to 687) 
Pawlowski, K. 
Direct Submission 

Submitted ( 27-SEP-1996 ) K. Pawlowski, Dept. Molecular Biology, 
Agricultural University Wageningen, Dreijenlaan 3, 6703 HA 
Wageningen, NETHERLANDS 

Location /Qualifiers 

1. .687 

/organism="Alnus glutinosa" 
/mol_type=" mRNA " 
/db_xref="taxon:3517" 
/clone_lib="lambda ZAP II" 
/dev_stage="root nodule" 
1. .687 
/gene="agl64" 
45. .305 
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sig_peptide 
mat_peptide 



ORIGIN 



/gene="agl64" 
/codon_start=l 
/protein_id="CAA69708. 1" 
/db_xref ="GI : 2437817" 
/db_xref="InterPro: IPR0108 00" 
/db_xref="UniProtKB/TrEMBL:0226 71" 

/translat ion="MGYFNTFLLLGLVFAVVLLI SSDVSAGELVGATQTKENMQTDGA 

EEVKYHGHRHVHGHGHGNGHGHHGHGHSGHWAADETETDRN" 

45. .131 

/gene="agl64" 

132. .302 

/gene="agl64" 

/product =" unnamed" 



Alignment Scores: 
Pred. No . : 
Score : 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 



1 .09e-29 

354.50 

73.7% 

70.7% 

65.3% 



Length : 
Matches : 
Conservative : 
Mismatches : 
Indels : 
Gaps : 



US-10-566-598-1 (1-99) x Y08436 (1-687) 

Qy 1 MetGlyTyrSerLysThrPheLeuLeuLeuGlyLeuAlaPheAlaValValLeuLeuIle 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 45 ATGGGTTACTTCAACACTTTCCTTCTCCTTGGCCTAGTCTTTGCCGTTGTGCTCCTCATC 104 

Qy 21 SerSerAspValSerAlaSerGluLeuAlaValAlaAlaGlnThrLysGluAsnMetGln 40 

I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 

Db 105 TCT TCCGATGTCTCAGCTGGTGAGCTTGTTGGTGCCACTCAAACC AAGGAGAATATGCAA 16 4 

Qy 41 ThrAspGlyValGluGluAspLysTyrHisGlyHisArgHisValHisGlyHisGlyHis 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 165 ACTGATGGTGCGGAGGAGGTCAAGTATCATGGCCATCGTCACGTGCATGGACATGGGCAT 22 4 

Qy 61 GlyHisValHisGlyAsnGlyAsnGluHisGlyHisGlyHisHisHisGlyArgGlyHis 80 

III::: I I I I I I I I I I I I I I I I I I I I I 

Db 225 GGGAATGGACATGGA CACCACGGCCATGGTCAC 25 7 



81 ProGlyHisGlyAlaAlaAlaAspGluThrGluThrGluThrGluThrAsnGlnAsn 99 
I I I I I I I I I I I I I I I I I I I I I I I I I I I :: : I I I I I I : : : 

258 TCGGGACATGTTGTTGCTGCTGATGAGACAGAAACTGATCGAAATTAAAACCAATCA 314 
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